NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Variational Learning Finds Flatter Solutions at the Edge of Stability

Ghosh, Avrajit; Cong, Bai; Yokota, Rio; Ravishankar, Saiprasad; Wang, Rongrong; Tao, Molei; Khan, Mohammad Emtiyaz; Möllenhoff, Thomas (September 2025, NeurIPS)

Free, publicly-accessible full text available September 18, 2026
SITCOM: Step-wise Triple-Consistent Diffusion Sampling For Inverse Problems

Alkhouri, Ismail; Liang, Shijun; Huang, Cheng-Han; Dai, Jimmy; Qu, Qing; Ravishankar, Saiprasad; Wang, Rongrong (July 2025, International Conference on Machine Learning)

Diffusion models (DMs) are a class of generative models that allow sampling from a distribution learned over a training set. When applied to solving inverse problems, the reverse sampling steps are modified to approximately sample from a measurement-conditioned distribution. However, these modifications may be unsuitable for certain settings (e.g., presence of measurement noise) and non-linear tasks, as they often struggle to correct errors from earlier steps and generally require a large number of optimization and/or sampling steps. To address these challenges, we state three conditions for achieving measurement-consistent diffusion trajectories. Building on these conditions, we propose a new optimization-based sampling method that not only enforces standard data manifold measurement consistency and forward diffusion consistency, as seen in previous studies, but also incorporates our proposed step-wise and network-regularized backward diffusion consistency that maintains a diffusion trajectory by optimizing over the input of the pre-trained model at every sampling step. By enforcing these conditions (implicitly or explicitly), our sampler requires significantly fewer reverse steps. Therefore, we refer to our method as Step-wise Triple- Consistent Sampling (SITCOM). Compared to SOTA baselines, our experiments across several linear and non-linear tasks (with natural and medical images) demonstrate that SITCOM achieves competitive or superior results in terms of standard similarity metrics and run-time.
more » « less
Free, publicly-accessible full text available July 14, 2026
Learning Dynamics of Deep Matrix Factorization Beyond the Edge of Stability

Ghosh, Avrajit; Kwon, Soo Min; Wang, Rongrong; Ravishankar, Saiprasad; Qu, Qing (May 2025, International Conference on Learning Representations)

Free, publicly-accessible full text available May 1, 2026
Learning Dynamics of Deep Matrix Factorization Beyond the Edge of Stability

Ghosh, Avrajit; Kwon, Soo Min; Wang, Rongrong; Ravishankar, Saiprasad; Qu, Qing (March 2025, International Conference on Learning Representations)

Deep neural networks trained using gradient descent with a fixed learning rate eta often operate in the regime of ``edge of stability'' (EOS), where the largest eigenvalue of the Hessian equilibrates about the stability threshold 2/eta. In this work, we present a fine-grained analysis of the learning dynamics of (deep) linear networks (DLNs) within the deep matrix factorization loss beyond EOS. For DLNs, loss oscillations beyond EOS follow a period-doubling route to chaos. We theoretically analyze the regime of the 2-period orbit and show that the loss oscillations occur within a small subspace, with the dimension of the subspace precisely characterized by the learning rate. The crux of our analysis lies in showing that the symmetry-induced conservation law for gradient flow, defined as the balancing gap among the singular values across layers, breaks at EOS and decays monotonically to zero. Overall, our results contribute to explaining two key phenomena in deep networks: (i) shallow models and simple tasks do not always exhibit EOS; and (ii) oscillations occur within top features}. We present experiments to support our theory, along with examples demonstrating how these phenomena occur in nonlinear networks and how they differ from those which have benign landscapes such as in DLNs.
more » « less
Free, publicly-accessible full text available March 25, 2026
Learning Dynamics of Deep Matrix Factorization Beyond the Edge of Stability

Ghosh, Avrajit; Kwon, Soo Min; Wang, Rongrong; Ravishankar, Saiprasad; Qu, Qing (March 2025, The Thirteenth International Conference on Learning Representations)

Free, publicly-accessible full text available March 5, 2026
Analysis of Deep Image Prior and Exploiting Self-Guidance for Image Reconstruction

https://doi.org/10.1109/TCI.2025.3540706

Liang, Shijun; Bell, Evan; Qu, Qing; Wang, Rongrong; Ravishankar, Saiprasad (February 2025, IEEE Transactions on Computational Imaging)

Free, publicly-accessible full text available February 19, 2026
Image Reconstruction Via Autoencoding Sequential Deep Image Prior

Alkhouri, Ismail; Liang, Shijun; Bell, Evan; Qu, Qing; Wang, Rongrong; Ravishankar, Saiprasad (December 2024, Advances in Neural Information Processing Systems)

Recently, Deep Image Prior (DIP) has emerged as an effective unsupervised one-shot learner, delivering competitive results across various image recovery problems. This method only requires the noisy measurements and a forward operator, relying solely on deep networks initialized with random noise to learn and restore the structure of the data. However, DIP is notorious for its vulnerability to overfitting due to the overparameterization of the network. Building upon insights into the impact of the DIP input and drawing inspiration from the gradual denoising process in cutting-edge diffusion models, we introduce Autoencoding Sequential DIP (aSeqDIP) for image reconstruction. This method progressively denoises and reconstructs the image through a sequential optimization of network weights. This is achieved using an input-adaptive DIP objective, combined with an autoencoding regularization term. Compared to diffusion models, our method does not require training data and outperforms other DIP-based methods in mitigating noise overfitting while maintaining a similar number of parameter updates as Vanilla DIP. Through extensive experiments, we validate the effectiveness of our method in various image reconstruction tasks, such as MRI and CT reconstruction, as well as in image restoration tasks like image denoising, inpainting, and non-linear deblurring.
more » « less
Full Text Available
Improving Training Efficiency of Diffusion Models via Multi-Stage Framework and Tailored Multi-Decoder Architecture

Zhang, Huijie; Lu, Yifu; Alkhouri, Ismail; Ravishankar, Saiprasad; Song, Dogyoon; Qu, Qing (June 2024, Conference on Computer Vision and Pattern Recognition)

Diffusion models, emerging as powerful deep generative tools, excel in various applications. They operate through a two-steps process: introducing noise into training samples and then employing a model to convert random noise into new samples (e.g., images). However, their remarkable generative performance is hindered by slow training and sampling. This is due to the necessity of tracking extensive forward and reverse diffusion trajectories, and employing a large model with numerous parameters across multiple timesteps (i.e., noise levels). To tackle these challenges, we present a multi-stage framework inspired by our empirical findings. These observations indicate the advantages of employing distinct parameters tailored to each timestep while retaining universal parameters shared across all time steps. Our approach involves segmenting the time interval into multiple stages where we employ custom multi-decoder U-net architecture that blends time-dependent models with a universally shared encoder. Our framework enables the efficient distribution of computational resources and mitigates inter-stage interference, which substantially improves training efficiency. Extensive numerical experiments affirm the effectiveness of our framework, showcasing significant training and sampling efficiency enhancements on three state-of-the-art diffusion models, including large-scale latent diffusion models. Furthermore, our ablation studies illustrate the impact of two important components in our framework: (i) a novel timestep clustering algorithm for stage division, and (ii) an innovative multi-decoder U-net architecture, seamlessly integrating universal and customized hyperparameters.
more » « less
Full Text Available
Optimal Eye Surgeon: Finding image priors through sparse generators at initialization

Ghosh, Avrajit; Zhang, Xitong; Sun, Kenneth K; Qu, Qing; Ravishankar, Saiprasad; Wang, Rongrong (June 2024, International Conference on Machine Learning)

We introduce Optimal Eye Surgeon (OES), a framework for pruning and training deep image generator networks. Typically, untrained deep convolutional networks, which include image sampling operations, serve as effective image priors (Ulyanov et al., 2018). However, they tend to overfit to noise in image restoration tasks due to being overparameterized. OES addresses this by adaptively pruning networks at random initialization to a level of underparameterization. This process effectively captures low-frequency image components even without training, by just masking. When trained to fit noisy images, these pruned subnetworks, which we term Sparse-DIP, resist overfitting to noise. This benefit arises from underparameterization and the regularization effect of masking, constraining them in the manifold of image priors. We demonstrate that subnetworks pruned through OES surpass other leading pruning methods, such as the Lottery Ticket Hypothesis, which is known to be suboptimal for image recovery tasks (Wu et al., 2023). Our extensive experiments demonstrate the transferability of OES-masks and the characteristics of sparse-subnetworks for image generation.
more » « less
Full Text Available
Robust Self-Guided Deep Image Prior

Bell, Evan; Liang, Shijun; Qu, Qing; Ravishankar, Saiprasad (April 2023, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP))

In this work, we study the deep image prior (DIP) for reconstruction problems in magnetic resonance imaging (MRI). DIP has become a popular approach for image reconstruction, where it recovers the clear image by fitting an overparameterized convolutional neural network (CNN) to the corrupted/undersampled measurements. To improve the performance of DIP, recent work shows that using a reference image as an input often leads to improved reconstruction results compared to vanilla DIP with random input. However, obtaining the reference input image often requires supervision and hence is difficult in practice. In this work, we propose a self-guided reconstruction scheme that uses no training data other than the set of undersampled measurements to simultaneously estimate the network weights and input (reference). We introduce a new regularization that aids the joint estimation by requiring the CNN to act as a powerful denoiser. The proposed self-guided method gives significantly improved image reconstructions for MRI with limited measurements compared to the conventional DIP and the reference-guided method while eliminating the need for any additional data.
more » « less
Full Text Available

« Prev Next »

Search for: All records